home *** CD-ROM | disk | FTP | other *** search
Text File | 1995-07-31 | 5.5 KB | 96 lines | [TEXT/QED1] |
- AA Bta1 Bta2:MT Bta3:16338 Bta4:Bos taurus Bta5 Bta6 Dya1 Dya2:MT Dya3:16019 Dya4:Drosophila yakuba Dya5 Dya6 ZA ZB ZC
- AT6 15.93 20.08 7 + 8290 8967 16.18 20.38 7 + 4066 4739 AT 6 ATPase 6
- AT8 14.95 16.15 6 + 8129 8326 15.21 16.22 6 + 3911 4072 AT 8 ATPase 8
- CO1 0 9.43 1 + 5687 7228 0 9.58 1 + 1474 3009 CO 1 CO I
- CO2 10.33 14.49 4 + 7374 8054 10.04 14.31 3 + 3083 3767 CO 2 CO II
- CO3 20.09 24.88 8 + 8970 9752 20.39 25.31 8 + 4740 5528 CO 3 CO III
- CYB 54.03 60.98 20 + 14514 15650 56.44 63.53 24 + 10515 11651 CY B Cyt B
- ND1 84.17 90 28 + 3101 4054 63.87 69.95 26 - 11706 12680 ND 1 ND 1
- TRS 9.44 9.87 2 - 7229 7299 63.57 63.98 25 + 11658 11723 TR S Ser tRNA (UGA-UCN)
- TRT 61.02 61.45 21 + 15657 15726 52.3 52.7 21 + 9852 9916 TR T Thr tRNA
- TRV 73.67 74.08 25 + 1386 1452 78.71 79.15 29 - 14082 14153 TR V Val tRNA
- TRW 97.68 98.08 33 + 5308 5374 98.71 99.12 35 + 1267 1333 TR W Trp tRNA
- TRX 38.87 39.3 16 + 12038 12108 70.02 70.42 27 - 12691 12755 TR X Leu tRNA (UAG-CUN)
- TRY 99.58 99.99 37 - 5618 5685 99.54 99.96 37 - 1401 1468 TR Y Tyr tRNA
- TRZ 38.5 38.86 15 + 11977 12036 29.5 29.92 14 + 6200 6267 TR Z Ser tRNA (GCU-AGY)
-
-
- Above is a small portion of the animal mtDNA gene map matrix, to illustrate the
- format required. Try to follow this format as closely as possible, for reasons
- that are mostly obscure and related to the creation of gene maps in DNA Translator.
- and sorting capabilities of Excel. There may be a more logical or more efficient format,
- but these might require rewrites of scripts. The following discussion assumes
- that Microsoft Excel is being used to maintain the database. Other spreadsheet
- programs should work equally well. The files should be saved as "Text Only"
- (tab delineated) or "CSV" (comma delineated) before import to DNA Translator,
- but you may find it wise to also save your matrix in Excel format. Note that
- you can control the number format of particular columns if you are satisfied with,
- e.g., 2 or 3 digit accuracy, but once saved as text the original accuracy is lost.
-
- The first line is a header line, which includes important information about included
- molecules as well as a format which can be sorted for generation of gene order
- information (Eernisse, in prep.). The first and last three columns are named for
- sorting alphabetically, and these columns contain, respectively: 1) gene or feature
- short abbrev. (button name on gene map with certain exceptions, i.e., tRNAs);
- 2) same as 1st 1 to n-1 characters of abbreviation; 3) last character of abbreviation;
- 4) full or shortened gene name that you wish to display. In between these columns
- are the gene data columns for 1 or more molecules, six per molecule. The header column
- is named in a set format, e.g., "Bta1,Bta2:MT,Bta3:16338,Bta4:Bos taurus,Bta5,Bta6"
- here tab-delimited (comma-delimited should work as well). "Bta" is derived from the
- first letter of the genus name and first two letters of the species name, followed by
- the number of the column. Columns 2-4 are appended by important information. If the
- molecule is mitochondrial or chloroplast DNA, column 2 (Bta2) should be appended by
- ":MT" or ":CP". Append column 3 (Bta3) by the entire length of the molecule (important).
- Append column 4 (Bta4) by the complete "Genus species" name the molecule was extracted
- from (important).
-
- [7/31/95: Above is still more or less current except that I now store additional
- info: Bta5:Accession Number, Bta6:Common Name. The latter is especially advised
- because I am up to 26 animal mtDNA maps. Also, the various taxa are now arranged
- phylogenetically, rather than alphabetically.]
-
- To enter a new molecule in an existing matrix, follow this procedure:
-
- 1a) use GenBank/EMBL sequence import with creation of a "map matrix" on a utility
- card of DNA Translator. This will do most of the work for a single molecule
- map.
-
- or:
-
- 1b) type in the coordinates and features manually in spreadsheet columns, following
- the above example.
-
- 2) open the matrix in something like Excel, and edit the feature names as I have
- done in the above example so that short meaningful button names will be displayed.
-
- 3) if you have more than one molecule that are comparable, you may want to combine
- them as I have above. Repeat step 1 for each, and combine them in a single
- spreadsheet, taking care that the right coordinates are placed with the right
- features (i.e., sort first alphabetically by feature). This will require that
- you recalculate the coordinates with spreadsheet commands, which I can explain
- if anyone out there is interested (be careful to replace the formulas with values
- via "Paste Special" in Excel before sorting, but keep a backup for the next
- molecule that you will add to the gene map).
-
- 4) Note that you may need to do some Excel manipulations so that the lowest
- relative coordinate is always in the left-hand column (e.g., 9.44 is to left
- of 9.87 above), even though the above-mentioned gene map matrix export doesn't
- export them in this way (my fault--I will try to fix this!).
-
- 5) save in "CSV" format (comma-delimited text) or text tab-delimited format. Examine
- first with a text editor and strip quotation marks that Excel may have added.
-
- 6) Create a new gene map from the menu of a gene mapping card in DNA Translator.
-
- 7) Export multiple sequences for some protein-coding gene to Aligner stack. This
- will prompt you for choosing a genetic code for each genome mapped. Thereafter,
- it will be automatic.
-
- To export the current matrix from a gene mapping card, select "Export Map Data"
- from the Edit menu.
-
- Please send suggestions, improvements, new map matrices (i.e., as exported from
- DNA Translator in a file attached to email) to:
-
- DEernisse@fullerton.edu
-